PhD Research Proposal - Visualising Software Corpus Analysis

نویسنده

  • Craig Anslow
چکیده

Despite the spread of software development and software usage, we have almost no dependable data on how software is actually written in practice. Understanding the shape of existing software is an important step to understanding what good software looks like. Our proposal is to undertake quantitative studies of the way software is actually written in practice and evolved over time by collecting large corpora of software in object-oriented and aspect-oriented programming languages. We will then create tools to produce visualisations of the structure and behaviour of the software using visualisation techniques to characterize each language’s characteristic patterns of usage and frequency. In other disciplines, e.g. applied linguistics [12], this kind of approach is well established. Our proposed kind of corpus analysis will expose how programmers actually use languages, what features of languages are used, and better inform programming pedagogy, software language design, and software understanding. A key observation leading to this proposal is that software that could comprise such corpora have become available for study only in the last decade. One source of corpora will be free and open-source software (FOSS) that is freely accessible over the Internet.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PhD Research Proposal Aspects to Support the Software Process

We have suggested [14] using the aspect-oriented paradigm to support the software development process by composing (weaving) aspects into the software development environment. In the first part of our research we have defined aspects to support the Extreme Programming (XP) development methodology and then wove them into the Eclipse platform. Several directions for future research have been iden...

متن کامل

Praaline: Integrating Tools for Speech Corpus Research

This paper presents Praaline, an open-source software system for managing, annotating, analysing and visualising speech corpora. Researchers working with speech corpora are often faced with multiple tools and formats, and they need to work with ever-increasing amounts of data in a collaborative way. Praaline integrates and extends existing time-proven tools for spoken corpora analysis (Praat, S...

متن کامل

Ph.D. Proposal: Automatic Repair of Loops

This PhD topic is about automatic software repair. Automatic software repair is the process of fixing software bugs automatically. Research on automatic software repair has recently started, esp. since the invention of GenProg, an automatic repair system for C code [3]. We have been successfully contributing to this field [4, 5, 6, 1]. The PhD student will explore how to automatically repair a ...

متن کامل

Explora: Infrastructure for Scaling Up Software Visualisation to Corpora

Visualisation provides good support for software analysis. It copes with the intangible nature of software by providing concrete representations of it. By reducing the complexity of software, visualisations are especially useful when dealing with large amounts of code. One domain that usually deals with large amounts of source code data is empirical analysis. Although there are many tools for a...

متن کامل

Explora: Tackling Corpus Analysis with a Distributed Architecture

When analysing a corpus of software, researchers often ask questions that entail exploration and navigation, such as “what packages contain fat interfaces in open-source systems?”, “how consistently is the code being commented?” and “are naming conventions being followed?”. The answers to these questions can impact software maintainability and evolution. Software visualisation can be of aid to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008